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(54) Title: DETECTING TUMOURS 



(57) Abstract 

A method of determining whether a human patient has 
a tumour or whether a tumour has metastasised comprising 
the steps of (1) obtaining a sample of tissue from the patient, 
the said tissue being one that does not normally contain 
a cytokeratin 20 (CK20) gene product and (2) determining 
whether a cytokeratin 20 (CK20) gene product is present in 
said tissue sample. Once it has been determined whether a 
human patient has a tumour or is Likely to develop metastatic 
disease from such a tumour, the physician can decide on an 
appropriate course of treatment. 
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DETECTI NG TUMOTTKS 

The present invention relates to methods of detecting tumours including 
metastatic disease in a patient, particularly metastatic disease of epithelial 
5 cell tumour origin, more particularly disseminating colon carcinoma. 

The development and growth of malignant tumours or cancers commonly 
results in the release of some of the cancerous cells from the developing 
tumour into the blood or other body fluids. These cells are then 

10 transported to other parts of the body where they may become implanted 
and set up secondary tumours or metastases, thus leading to a general 
dissemination or spread of the original cancer that is responsible for 
production of the primary tumour. The process of metastasis may 
commence at quite an early stage in the development and growth of the 

15 primary tumour, and in fact it is metastases, frequently haematogenous 
metastases produced by cancer tumour cells circulating in the peripheral 
blood, that determine the outcome of the disease for most patients. 

It is important to detect such cancer cells in body fluids, particularly 
20 peripheral blood, as this may aid in diagnosing the original cancer and 
monitoring the disease. In particular, detection of such cancer cells in the 
peripheral blood is an indicator of the likelihood of metastatic disease and 
is useful to the physician in deciding upon a suitable course of treatment. 

25 The number of such cancer or tumour cells circulating in such body fluids, 
particularly peripheral blood, is generally very smalt and they cannot 
therefore be distinguished and readily detected by routine microscopy. 
Techniques for their detection therefore need to be highly sensitive but 
must remain specific. 

30 
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Because of the huge number of blood cells compared with potential cancer 
or tumour cells it is very important that any marker that is used to detect 
the said cancer or tumour cells is not found in the normal blood cells. 

5 Various methods have been described previously in an attempt to develop 
methods of detecting metastatic disease by analysing blood and bone 
marrow samples for cancer cells. Moss & Sanders (1990) J. Clin. Oncol. 
8, 736-740 describes the detection of neuroblastoma cells in blood using 
monoclonal antibodies reactive against unidentified neuroblastoma cell 

0 antigens. 

Sawyers et al (1990) Proc. Natl. Acad. Sci. USA 87, 563-567 discloses a 
method of detecting chronic myelogenous leukaemia (CML) cells in the 
blood of a patient using the polymerase chain reaction (PCR). In this case 
cellular mRNA was extracted from blood samples and cDNA made using 
reverse transcriptase (RT). Thus, a RT-PCR was used to amplify cDNA 
corresponding to mRNA transcribed from the abnormal gene found in 
CML. 

GB 2 260 811 A discloses a general method for the diagnosis or 
monitoring of cancer of a malignant tumour using a RT-PCR. In this 
case, melanoma cells were detected in blood of a patient using a RT-PCR. 
Specific oligonucleotide primers were used to amplify tyrosinase mRNA. 
Tyrosinase is not expressed in normal blood but is expressed at a 
relatively high level in some melanoma cells (which are derived from 
melanocytes); however, analysis of tyrosinase gene expression is 
complicated by the presence of tyrosinase-related protein genes and the 
method is limited because some melanomas lack tyrosinase expression 
(amelanotic tumours). 
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Burchill et al (1994) Int. J. Cancer SI, 671-675 describes neuroblastoma 
cell detection by RT-PCR for tyrosine hydroxylase mRNA. 

It will be appreciated that melanoma, neuroblastoma and CML are 
5 tumours arising from highly specialised cell types and, in the case of 
CML, arising from a cell containing chromosomal abnormality. Some of 
the most prevalent cancers arise from less specialised epithelial cells, for 
example breast cancer and colon carcinoma. Clearly, any method of 
detecting cells in the blood derived from such epithelial cell tumours must 
rely on the marker to be detected not being expressed in normal -blood. 



10 



Cytokeratins (CKs) are components of mammalian cell cytoskeleton and 
constitute a multigene family of proteins (see Nagel (1988) Am. J. Surg. 
Pathol. 12, (suppl. 1), 4-16) and Moll et al (1982) Cell 31, 11-24 for 
15 reviews). CKs are expressed predominantly in epithelial cells where they 
show strict lineage and differentiation-associated patterns of expression. 
Malignant cells generally retain the CKs of their progenitor cell type and 
have been used previously to characterise neoplastic cells of epithelial 
origin. 

20 

Traweek et al (1993) Am. J. Pathol. 142, 1111-1118 have used RT-PCR 
to detect the expression of CK8, CK18 and CK19 in various tissues. CK8 
and CK18 are expressed in all tissue that has been studied including 
peripheral blood mononuclear cells, aspirated bone marrow cells and 
25 lymph nodes. Some of these cells are components of normal blood and 
so CK8 and CK18 are expressed in normal blood and, consequently, are 
of no use in analysing blood samples for the presence of cells derived 
from an epithelial cell tumour. 



30 Although Traweek et al apparently did not detect CK19 gene activity i 
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normal peripheral blood and bone marrow, they found that the CK19 
activity in stromal cells caused problems and that CK19 expression was 
detected in lymph nodes. Datta et al (1994) J. Clin. Oncol. 12, 475-482 
have used RT-PCR to analyse the expression of CK19 in breast cancer 
5 patients and, apparently, no gene expression was detected in the blood of 
patients at stages I, II or III but only ten of twenty-six patients with 
metastatic disease gave a positive result using the RT-PCR test. However, 
a low frequency of CK19 expression was found in normal blood indicating 
illegitimate transcription of CK19 mRNA or the presence of CK19- 
10 expressing cells in normal blood or bone marrow. Furthermore, as 
discussed in more detail in the Examples of the present invention, we have 
found CK19 expression using RT-PCR in six out of fifteen control blood 
samples. 

15 Thus, CK19, like CK8 and CK18, is not a suitable target for detection of 
tumour cells in peripheral blood. 

Surprisingly, we have found that CK20 is not expressed in normal blood 
samples. CK20 is expressed, for example, in a high proportion of 

20 colorectal carcinomas, stomach cancers, mucinous ovarian 
adenocarcinomas and transitional cell carcinomas. Thus, it is an object of 
the present invention to provide methods for detecting these and other 
tumours and, in particular, metastatic disease, by detecting the presence 
of CK20 gene expression in the blood or bone marrow or other suitable 

25 tissue of a patient. 

One aspect of the present invention provides a method of determining 
whether a human patient has a tumour or whether a tumour has 
metastasised comprising the steps of (1 ) obtaining a sample of tissue from 
30 the patient, the said tissue being one that does not normally contain a 
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cytokeratin 20 (CK20) gene product and (2) determining whether a 
cytokeratin 20 (CK20) gene product is present in said tissue sample. 



By "determining whether a tumour has metastasised" we include 
5 determining whether any tumour cells are found at a site remote from a 
primary tumour whether or not those tumour cells are found in a further 
tumour. 



Thus, the method includes determining the likelihood that a tumour has 
10 spread or is spreading or will spread in a patient. 

It will be appreciated that, in essence, the invention comprises determining 
whether a tissue sample from a patient contains a cytokeratin 20 (CK20) 
gene product, particularly a tissue sample which does not normally contain 
15 a cytokeratin 20 (CK20) gene product. 

By "CK20 gene product" we include CK20 mRNA and CK20 protein. 

It will be appreciated by a person skilled in the art that the human CK20 
20 gene, as is the case for many human genes, is polymorphic and therefore 
that many allelic forms occur. Thus, by "CK20 gene" we include all 
allelic forms. Different allelic forms can be readily detected by comparing 
one CK20 gene or mRNA sequence with another, for example by DNA 
sequencing or restriction fragment length polymorphism (RFLP) analysis. 

25 

We have found unexpectedly that a CK20 gene product is not present in 
blood or bone marrow from a human who does not suffer from a tumour 
or, particularly, in whom a tumour has not metastasised. Thus, it is 
preferred that the tissue sample is blood or bone marrow. Blood is readily 
30 obtained from the patient using venepuncture whereas bone marrow is 
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obtained using standard aspiration techniques (for example by placing a 
needle into the iliac bone and aspirating). Purged bone marrow is 
suitable. Alternatively, lymph nodes, peripheral stem cell harvests, urine 
and cystectomy samples may be used. Thus, we include urine in the term 
5 "tissue sample". A suitable sample also includes polymorphonuclear cells 
such as neutrophils. 

When we state that a CK20 gene product is not present in blood or bone 
marrow from a human who does not suffer from a tumour we mean that 
the amount of any CK20 gene product is so low that it cannot be detected, 
at least by presently available techniques. The amount of CK20 gene 
product present in the tissue (such as blood) of a human patient who is 
determined by the method of the invention to have CK20 gene product 
present in said tissue, compared to a human who does not have a CK20 
15 gene product (as defined), is at least two-fold higher, preferably at least 
10-fold higher, more preferably at least 100-fold higher and most 
preferably at least 1000-fold higher. 



10 



20 



It is preferred if the tumour is an epithelial cell tumour. 



There are many epithelial cell-derived tumours including breast carcinoma 
colorectal carcinoma, stomach adenocarcinoma, mucinous ovarian 
adenocarcinoma, all bladder carcinoma including dysplasia and transitional 
cell carcinoma. The vast majority of mucinous ovarian and colorectal 
25 adenocarcinomas express CK20; greater than 75% of stomach and gall 
bladder adenocarcinomas contain CK20-expressing cells; and more than 
half of pancreatic adenocarcinomas contain CK20-expressing cells. 



It is less preferred if the tumour is a lung tumour. 
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A high proportion of Merkel cell carcinomas and transitional cell 
carcinomas express CK20. 

Thus, the method of the invention is particularly suited to determine 
5 whether a patient has, or is likely to develop metastatic disease from, 
mucinous ovarian and colorectal adenocarcinomas. 

Determination of colorectal adenocarcinoma and metastasis therefrom are 
particularly preferred. 

10 

It is preferred that the patient has not suffered a local trauma or has not 
undergone surgery both of which may result in the release of epithelial 
cells into the blood. 

15 Presence of a cytokeratin 20 gene product can be determined in a tissue 
sample using various techniques. Conveniently, it is determined by 
detecting GK20 messenger RNA (mRNA). Although, in principle, CK20 
mRNA can be detected in a tissue sample by hybridising a specific nucleic 
acid probe to the mRNA (such as an antisense RNA or antisense 

20 oligonucleotide, the said probe being labelled with a readily-detectable 
moiety for example, a fluorescent dye or a radionuclide), the sensitivity 
of such a method may not allow the detection of the very small amount of 
CK20 mRNA that is present in a tissue sample. For example, a 5 ml 
blood sample may contain only tens of CK20-expressing cells derived 

25 from the epithelial cell tumour but tens of thousands of cells in total. 
Thus, it- is preferred that the mRNA is detected by, first, making a 
complementary DNA (cDNA) copy of the CK20 mRNA and, second, 
amplifying the cDNA using DNA amplification methods. 

30 In a particularly preferred embodiment the following steps are used: 
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1. A tissue sample is obtained from the patient. Conveniently, 2 ml 
of blood is removed from the patient and frozen at -80°C. 

2. Total cellular RNA is extracted from the blood sample, for example 
5 using standard techniques described in Sambrook et al (1989) 

Molecular Biology, a laboratory manual. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, USA. Alternatively, 
commercially available kits can be used such as Ultraspec™ RNA 
(a trade mark) available from Biogenesis, Bournemouth, UK. 



10 



15 



The mRNA is converted into cDNA using suitable oligonucleotide 
primers, deoxynucleotides and an enzyme with reverse transcriptase 
activity. The oligonucleotide for making cDNA can be, for 
example, either: 



a) An oligo-dT primer which hybridises to the polyA tail of 
mRNA; 

b) Random short sequences, such as random hexamers, which 
prime cDNA synthesis of the total RNA; or 

20 c) Oligonucleotides specific for CK20 which prime cDNA 

synthesis from CK20 mRNA. 



4. Optionally any residual chromosomal DNA is removed using a 
nuclease such as DNase I. 

25 

5. Oligonucleotide primers are selected which direct specific 
amplification of the CK20 cDNA. A PCR is performed using these 
oligonucleotides, a thermal-stable DNA polymerase and 
deoxynucleotides. 



30 
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6. Optionally, further oligonucleotides are selected which direct 
specific amplification of a DNA fragment chosen from the inter- 
primer region defined in step 5 and a further PCR (a "nested" 
PCR) is carried out. 

5 

7. The product of the PCR reaction of step 5 or 6 is detected using 
agarose gel electrophoresis and ethidium bromide staining of the 
DNA. 

10 Nucleotide sequences of CK20 cDNAs are shown in Figures 5 to 7. The 
cDNA and gene sequence described in Moll et al (1993) Differentiation 
53, 75-93 is incorporated herein by reference as are the sequences and 
information from the relevant database submissions described in the 
legends to Figures 5 to 7. The approximate position of the introns (in the 

15 gene) are marked. It is preferred if oligonucleotides suitable for DNA 
amplification comprise a sequence selected from the sequence shown in 
any one of Figures 5 to 7 such that the first oligonucleotide is capable of 
hybridizing to one exon and the second oligonucleotide is capable of 
hybridizing to another exon. Preferably the oligonucleotide primers are 

20 between 10 and 50 nucleotides in length, more preferably between 14 and 
30 nucleotide long, most preferably around 20 nucleotides long. 

It is preferred if the oligonucleotide primers can hybridize to all alleles of 
the CK20 gene. Regions of the CK20 gene and cDNAs common to all 
25 alleles can be determined by comparing the sequences of CK20 genes and 
cDNAs such as those shown in Figures 5 to 7. 

Preferred oligonucleotide primers comprise or consist of the sequence 5'- 
C AG AC AC ACGGTG A ACTATGG-3 ' (SEQ ID No 1) and 5'- 
30 G ATC AGCTTCC ACTGTTAG ACG-3 ' (SEQ ID No 2) . Further preferred 
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oligonucleotide primer pairs are (all listed 5' -* 3'): 

CTCCTG G A ATCTC C A ATG G (SEQ ID No 3) and 
GCATTTTGCAGTTGAGCATCC (SEQ ID No 4); 

5 

CTCC A ATG G ATTTC AGTCG (SEQ ID No 5) and 
AATTTGCAGGACACACCGAGC (SEQ ID No 6); 

CTAAATGACCGTCTAGCGAGC (SEQ ID No 7) and 
10 TCCACATTGACAGTGTTGCCC (SEQ ID No 8); 

CCAACTCCAAACTTGAAGTGC (SEQ ID No 9) and 
TCCACATTGACAGTGTTGCCC (SEQ ID No 10); and 

15 TGGGCAACACTGTCAATGTGG (SEQ ID No 11) and 
TCC ATGTT A CTC C G A ATCTG C (SEQ ID No 12). 

It is well known that the sequence at the 5' end of the oligonucleotide 
need not match the target sequence to be amplified. 

20 

It is usual that the PCR primers do not contain any complementary 
structures with each other longer than 2 bases, especially at their 3' ends, 
as this feature may promote the formation of an artifactual product called 
"primer dimer". When the 3' ends of the two primers hybridize, they 
25 form a "primed template" complex, and primer extension results in a short 
duplex product called "primer dimer". 



30 



Internal secondary structure should be avoided in primers. For symmetric 
PCR, a 40-60% G + C content is often recommended for both primers, 
with no long stretches of any one base. The classical melting temperature 
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calculations used in conjunction with DNA probe hybridization studies 
often predict that a given primer should anneal at a specific temperature 
or that the 72 °C extension temperature will dissociate the primer/template 
hybrid prematurely. In practice, the hybrids are more effective in the 
5 PCR process than generally predicted by simple T m calculations. 

Optimum annealing temperatures may be determined empirically and may 
be higher than predicted. Taq DNA polymerase does have activity in the 
37-55°C region, so primer extension will occur during the annealing step 
10 and the hybrid will be stabilized. The concentrations of the primers are 
equal in conventional (symmetric) PCR and, typically, within 0.1- to 1- 
/xM range. 

As an alternative to detecting the product of DNA amplification using 
15 agarose gel electrophoresis and ethidium bromide staining of the DNA, it 
is convenient to use a labelled oligonucleotide capable of hybridising to the 
amplified DNA as a probe. When the amplification is by a PCR the 
oligonucleotide probe hybridises to the interprimer sequence as defined by 
the two primers. The oligonucleotide probe is preferably between 10 and 
20 50 nucleotides long, more preferably between 15 and 30 nucleotides long. 
The probe may be labelled with a radionuclide such as 32 P, 33 P and 33 S 
using standard techniques, or may be labelled with a fluorescent dye. 
When the oligonucleotide probe is fluorescently labelled, the amplified 
DNA product may be detected in solution (see for example Balaguer et al 
25 (1991) "Quantification of DNA sequences obtained by polymerase chain 
reaction using a bioluminescence adsorbent" Anal. Biochem. 195, 105-1 10 
and Dilesare et at (1993) "A high-sensitivity electrochemiluminescence- 
based detection system for automated PCR product quantitation" 
BioTechniques 15, 152-157. 

30 
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Any of the DNA amplification protocols can be used in the method of the 
invention including the polymerase chain reaction, QB replicase and ligase 
chain reaction. The polymerase chain reaction is particularly preferred 
because of its simplicity. 

5 

The polymerase chain reaction may, if desired, be carried out in situ, or 
at least in whole tissue samples isolated from the patient, using the 
methods described in Komminoth et al (1994) Pathol. Res. Pract. 190(1), 
1017-1025, incorporated herein by reference; and Komminoth et al (1994) 
10 Verhandlungers der Deutschen Gesellschafi fur Pathologic 78, 146-152, 
incorporated herein by reference. 

In principle it is possible to detect the presence of a CK20 gene product 
by determining whether the sample contains any CK20 protein. This is 

15 most conveniently achieved using antibodies that react specifically with 
CK20. For example, monoclonal antibody K 5 20.8 reactive against CK20 
can be purchased from Cymbus Bioscience Limited, Southampton, UK or 
other suitable monoclonal antibodies can be made using methods well 
known in the art. For example, suitable monoclonal antibodies to CK20 

20 may be prepared using the techniques described in Monoclonal Antibodies: 
a manual of techniques, H. Zola (CRC Press, 1988) and in Monoclonal 
Hybridoma Antibodies: Techniques and Applications, J.G.R. Hurrell 
(CRC Press, 1982). 

25 Conveniently, the antibody is labelled with a readily detectable marker 
such as a radionuclide or fluorescent dye. Suitable radionuclides include 
"Tc, 123 I and n P. Suitable fluorescent dyes include fluorescein. 



30 



Once it has been determined, according to the methods of the invention 
whether a human patient has a tumour or is likely to develop metastatic 
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disease from such a tumour, the physician can decide on an appropriate 
course of treatment. Other methods may supplement the present method 
in order to reach a diagnosis. 

5 A further aspect of the invention provides a kit of parts comprising 
oligonucleotide primers capable of amplifying a cytokeratin 20 (CK20) 
cDNA, deoxynucleotides and a DNA polymerase. 



The kit may further comprise means for extracting RNA suitable primers 
10 for making CK20 cDNA. an enzyme with reverse transcriptase activity, 
and means for detecting an amplified DNA product. 

The invention will now be described in more detail with reference to the 
following Figures and Examples in which: 

15 

Figure 1 shows the products of RT-PCR for CK8, 19 and 20 mRNA on 
1 pg to 100 ng of total mRNA isolated from RT1 12 (CK8), MCF7 (CK19) 
and HT29 (CK20) cell lines. A single band of 244, 214 and 370 bp 
respectively was identified after separation of products in an agarose gel 
20 and staining with ethidium bromide. There was an increase in the 
intensity of this band with increasing RNA concentration. 

M = molecular weight markers, W = water control. 

25 Figure 2 shows the products of RT-PCR for CK8(i), 19(ii) and 20fiii) 
mRNA separated by agarose gel electrophoresis and stained with ethidium 
bromide in 6 control bloods (1-6). 

C = positive control for CK8, 19 or 20 mRNA detection. W = water 
30 negative control. M = molecular weight markers. 
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Figure 3 shows the products of RT-PCR for CK20 mRNA separated by 
electrophoresis and stained with ethidium bromide in 6 control bone 
marrow samples (i). RT-PCR for GAP-DH in the same RNA samples 
(ii). 

5 

C = positive control of RNA extracted from HT29 cells. W = water 
negative control. M = molecular weight markers. 

Figure. 4 shows the products of RT-PCR for CK20 mRNA separated by 
10 agarose gel electrophoresis and stained with ethidium bromide in blood 
samples spiked with 1 to 10 3 HT29 cells. A single 370 bp band was 
identified when as few as 100 cells per ml of whole blood were analysed 
(3). No band was identified in unspiked blood (O). RT negative samples 
showed no amplified band (ii). 

15 

+ C = positive control of RNA extracted from HT29 cells. W = water 
negative control. M = molecular weight markers. 

Figure 5 shows the nucleotide sequence of part of a human CK20 mRNA 
20 with the amino acid sequence of the encoded polypeptide composed of the 
exon sequence taken from Accession X73501, (bases 1 to 18061). 

Author: Zimbelmann, R; Title: Direct Submission; Journal: Submitted 
(14 Sep 1993) to the EMBL/GenBank/DDBJ databases. R. Zimbelmann, 
25 German Cancer Research Center, Division of Cell Biology, Im 
Neuenheimer Feld 280, 69254 Heidelberg, FRG. 

Intron positions are shown with a IL 



30 



Figure 6 shows the nucleotide sequence of part of a human CK20 mRNA 
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.with the amino acid sequence of the encoded polypeptide composed of the 
exon sequence taken from as in 

Authors: Moll, R., Zimbelmann, R., Goldschmidt, M.D., Keith, M., 
5 Laufere, J., Kasper, M. f Koch, P.J. and Franke, W.W; Title: The 
human gene encoding cyto keratin 20 and its expression during fetal 
development and in gastrointestinal carcinomas; Journal: Differentiation 
53(2), 75-93 (1993). 

10 Intron positions are shown with a II 

Figure 7 shows the nucleotide sequence of part of a human CK20 mRNA 
taken from Accession X73502 

15 Reference: 1 (bases 1 to 1461); Authors: Calnek, D. and Quaroni, A.; 
Title: Differential localization by in situ hybridization of distinct keratin 
mRNA species during intestinal epithelial cell development and 
differentiation; Journal: Differentiation 53(2), 95-104 (1993); Medline: 
93366035. 

20 

Reference: 2 (bases 1 to 1461); Author: Quaroni, A.; Title: Direct 
Submission; Journal: Submitted (07 Sep 1993) to the 
EMBL/GenBank/DDBJ databases. A. Quaroni, Cornell University, 724 
A Vet. Res. Tower, Section of Physiology, Cornell University, Ithaca, 
25 NY 14853, USA. 



WO 96/17080 PCT/GB95/027J4 

16 

Example 1; Detection of croo-frxpressinp ppit h eiiai cpiu in 
blood 

Materials and methods 

5 

Cell Lines. The three well-characterised human cell lines used in the 
study were the transitional cell carcinoma-derived RT112 cell line, the 
breast adenocarcinoma MCF-7 cell line and the colonic adenocarcinoma 
HT29 cell line. MCF-7 and HT29 cells express CK8, CK18 and CK19 
(Moll etal (1982) Cell 31, 11-24), whereas RT! 12 express CK8, CKI'8 
CK19 along with other CK isotypes characteristic of bladder epithelial 
cells (Wu et al (1982) Cell 31, 693-703). CK20, the most recently 
identified CK isotype, is expressed by HT29 cells, but not by MCF-7 cells 
(Moll et al (1992) Am. J. Pathol. 140, 427-447). All cell lines were 
15 maintained in a 1:1 mixture of DMEM and RPMI 1640 media 
supplemented with 5% foetal bovine serum and passaged using 0.25% 
trypsin in versene (0.02% EDTA). 



10 



20 



Blood and bone marrow samples. Normal blood or bone marrow 
samples were obtained from volunteers aged between 18 to 45 years. 
Samples were taken into EDTA. aliquoted into 2 ml volumes and frozen 
at -80°C until required for RNA extraction. 

RNA extraction. Total cellular RNA was extracted from cell lines, 
25 normal whole blood, normal bone marrow or spiked normal blood using 
Ultraspec™ RNA (Biogenesis, Bournemouth, UK) according to the 
manufacturer's instructions. 



30 



Reverse transcriptase-polymerase chain reaction (RT-PCR). The RT- 
PCR method used was based on that for the detection of neuroblastoma 
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cells (Burchill et al (1994) Im. J. Cancer SI, 671-675). Briefly, dilution 
curves of RNA were DNase treated and reverse transcribed to produce 
cDNA using a random hexamer primer. RT products were amplified by 
PCR for CK8, CK19 or CK20 (primer sequences are given in Table 1). 
5 RT-PCR products were analysed by agarose gel electrophoresis and 
ethidium bromide staining. Reverse transcriptase negative controls (RT- 
ve) in which reverse transcriptase enzyme was omitted were included for 
all RT-PCR reactions. Water negative controls (W) contained all 
components for the RT-PCR reaction but no targeted RNA. Where 
10 appropriate, positive controls ( + C) of RNA extracted from HT29, MCF7 
or RT112 cells were included. Molecular weight markers (<f>X 174 RF 
DNA/Hae III, Gibco BRL, Paisley, Scotland or 123 bp ladder, Pharmacia, 
Milton Keynes, UK) were included on all agarose gels. 

15 The quality of RNA was confirmed by amplification of cDNA for GAP- 
DH or 18S probed Northern blot analysis. All primers were purchased 
from Oswell DNA Services (Edinburgh, Scotland). 

Table 1. Primer sequences used for PCR amplification of CK8. CK19 
20 and CK20 



25 





Sense primer 


Antisense primer 


CK8 


AACAACCTTAGGCGGCAGCT 

(SEQ ID No 13) 


GCCTGAGGAAGTTGATCTCG 
(SEQ ID No 14) 


CK19 


GCGGGACAAGATTCTTGGTG 

(SEQ ID No 15) 


CTTCAGGCCTTCGATCTGCAT 

(SEQ ID No 16) 


CK20 


CAGACACACGGTGAACTATGG 

(SEQ ID No 1) 


GATCAGCTTCCACTGTTAGACG 
(SEQ ID No 2) 



Primer sequences for PCR were selected using the Dieffenbach Selection 
Programme. Primers were located within different exons and were either 
20, 21 or 22 mers. 
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Specificity of RT-PCR. RT-PCR products were separated on agarose 
gels and Southern blotted onto nylon membrane (Hybond N + , Amersham, 
UK). Filters were hybridised with a gamma »P end-labelled 
oligonucleotide probe, the sequence of which lay between each primer set. 
5 The nucleotide sequence of RT-PCR products was confirmed by dideoxy 
chain termination sequencing (Sequenase, USS, Canada). 

Cell spiking. Cell spiking experiments were used to test the potential 
sensitivity of this technique for detection of colon carcinoma cells in 
10 blood. Known numbers of HT29 cells were added to whole blood 
samples, mRNA extracted and RT-PCR for CK20 performed. To 2 ml 
aliquots of whole blood 10 to 1 x 10 6 cells were added; an unspiked blood 
sample was included in each experiment. RNA (100 pg) from HT29 cells 
was included as a positive control. 



15 



Results 



RT-PCR detection of bladder, breast and colon carcinoma cells. RT- 
PCR for CK8 generated a single 244 bp band identified on ethidium 
20 bromide stained agarose gels (Fig I.i). This fragment was confirmed by 
Southern blot analysis and sequencing (data not shown) to be the fragment 
of CK8 which lies between the two primers used for PCR. The band was 
detected in 10 pg-100 ng of total RNA from RT112 cells. 

25 RT-PCR for CK19 generated a single 214 bp band identified on ethidium 
bromide stained agarose gels (Fig 1 ,ii) which was confirmed to be CK19 
by Southern blot analysis and sequencing (data not shown). This band 
was detected in 100 pg-1 ng of total RNA from MCF7 cells. 



30 



RT-PCR for CK20 generated a single band of 370 bp (Fig I, Hi). This 
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band was confirmed by Southern hybridisation and sequence analysis (data 
not shown) and detected in 100 pg of total RNA from HT29 cells. 

In all three cases there was an increase in band intensity with increasing 
5 amounts of RNA (Fig 1). No transcripts were identified in water control 
samples (Fig 1) or RT negative samples (results not shown). 

Control blood and bone marrow analysis. In 8/9 and 6/15 control blood 
samples analysed CK8 and CK19 RT-PCR products were identified under 

10 the described conditions. Southern blotting confirmed amplified bands 
were CK8 and CK19 RT-PCR products. In 15/15 control blood samples 
analysed, CK20 was undetectable by ethidium bromide staining or 
Southern blot hybridisation. RT-PCR results for CK8(i), CK19(ii) or 
CK20(iii) are shown in Fig 2 for six control bloods, 6/6 were positive for 

15 CK8, 3/6 for CK19 and 0/6 for CK20. RT-PCR for CK20 in 6/6 normal 
bone marrow samples showed no amplified bands (Fig 3,i). The integrity 
of bone marrow RNA samples was confirmed by RT-PCR for GAP-DH 
(Fig 3,ii). 

20 Cell spiking. In HT29 cell spiking experiments it was possible to detect 
down to 100 HT29 cells diluted in 2 ml of whole human blood (Fig 4,i). 
The 370 bp band generated was shown by Southern blotting to hybridise 
to a 32 P end-labelled oligonucleotide probe specific for CK20 and 
confirmed by sequence analysis to be that of CK20 (results not shown). 

25 No RT-PCR products were detected in whole blood alone (Fig 4,i 0). 
RT-PCR products were not identified in reverse transcriptase negative 
samples (Fig 4,ii). 

Since CK8 and CK19 were expressed in a high proportion of normal 
30 peripheral blood samples (88% and 40% respectively) neither would be 
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suitable targets for detection of tumour cells in peripheral blood. 



CK20 mRNA was not detected in any normal blood or bone marrow 
samples examined suggesting it to be the CK of choice for detection of 
5 carcinomas of epithelial origin. CK20 has been detected in almost all 
cases of colorectal adenocarcinomas by immunohistochemistry and is a 
useful target for the detection of disseminating colon carcinoma by RT- 
PCR. 

10 Exa mp!g 3; Determining whether a patient h qs an ppjf h f lipj r? \\ 
tumour 

5 ml of blood is removed from the patient. RNA is produced from the 
blood as described in Example 1 and a RT-PCR is carried out using the 
15 CK20-specific primers as described in Example 1 . If a CK20-specific 
DNA product is amplified it suggests that the patient may have an 
epithelial cell tumour. Further confirmatory tests may be performed in 
order to reach a diagnosis. 

20 Example 3: Determining whether a patient with colorectal carcinoma 
tumours i s likely to develop metastases 

5 ml of blood is removed from the patient. RNA is produced from the 
blood as described in Example 1 and a RT-PCR is carried out using the 
25 CK20-specific primers as described in Example 1. If a CK20-specific 
DNA product is amplified it suggests that the patient is likely to develop 
metastatic disease disseminated from the colorectal tumour. Further 
confirmatory tests may be performed in order to reach a diagnosis. 
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CLAIMS 

1. A method of determining whether a human patient has a tumour or 
whether a tumour has metastasised comprising the steps of (1) obtaining 
5 a sample of tissue from the patient, the said tissue being one that does not 
normally contain a cytokeratin 20 (CK20) gene product and (2) 
determining whether a cytokeratin 20 (CK20) gene product is present in 
said tissue sample. 

10 2. A method according to Claim I wherein the tumour is an epithelial 
cell tumour. 

3. A method according to Claims 1 or 2 wherein the tissue is blood 
or bone marrow. 

15 

4. A method according to any one of Claims 1 to 3 wherein the 
epithelial cell tumour is any one of colorectal adenocarcinoma, stomach 
adenocarcinoma, mucinous ovarian adenocarcinoma, bladder carcinoma, 
gall bladder adenocarcinoma and transitional cell carcinoma. 

20 

5. A method according to Claim 4 wherein the epithelial cell tumour 
is colorectal adenocarcinoma. 

6. A method according to any one of the preceding claims wherein the 
25 presence of the cytokeratin 20 (CK20) gene product is determined by 

detecting cytokeratin 20 (CK20) messenger RNA (mRNA). 



30 



7. A method according to Claim 6 wherein the cytokeratin 20 
messenger RNA (mRNA) is copied into complementary DNA (cDNA). 
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. 8. A method according to Claim 7 wherein the complementary DNA 
(cDNA) is amplified. 



9. A method according to Claim 8 wherein the complementary DNA 
5 (cDNA) is amplified using the polymerase chain reaction (PCR). 

10. A method according to any one of Claims 6 to 9 wherein genomic 
DNA is removed from or cleaved in the said sample prior to detecting 
cytokeratin 20 (CK20) messenger RNA (mRNA). 

0 

11. A method according to Claim 9 or 10 wherein the primers used in 
the polymerase chain reaction (PCR) comprise or consist of the sequences 
5'-CAGACACACGGTGAACTATGG-3' (SEQ ID No 1) and 5'- 
GATCAGCTTCCACTGTTAG ACG-3 ' (SEQ ID No 2). 



12. A kit of parts comprising oligonucleotide primers for amplifying a 
cytokeratin 20 (CK20) cDNA, deoxynucleotides and a DNA polymerase. 
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1 T 

2 GTCAGCAGAG GAGGAGTTTC TTGCCTGTGG ACTTCATAAA AGGCTAGCTC 



52 
102 



AACACCCTCC ATGAGACACA CTCTGCCCCA ACCATCCTGA AGCTACAGGT 

GCTCCCTCCT GGAATCTCCA ATGGATTTCA GTCGCAGAAG CTTCCACAGA 

MDFSRR SFHR 

152 AGCCTGAGCT CCTCCTTGCA GGCCCCTGTA GTCAGTACAG TGGGCATGCA 
SLSS SLQ APVVSTVGMQ 

202 GCGCCTCGGG ACGACACCCA GCGTTTATGG GGGTGCTGGA GGCCGGGGCA 
RLG TTPS VYGGAGG R 6l 

252 TCCGCATCTC CAACTCCAGA CACACGGTGA ACTATGGGAG GGATCTCACA 
R i S NSRHTVN YGS DLT 

302 GGCGGCGGGG ACCTGTTTGT TGGCAATGAG AAAATGGCCA TGCAGAACCT 
G G G D LFVGNE KMAMQNL 

352 AAATGACCGT CTAGCGAGCT ACCTAGAAAA GGTGCGGACC CTGGAGCAGT 
NDR LASYLEK VRT LEQS 

402 CCAACTCCAA ACTTGAAGTG CAAATCAAGC AGTGGTACGA AACCAACGCC 
NSK LE V Q IK Q WYE T N A 

452 CCGAGGGCTG GTCGCGACTA CAGTGCATAT TACAGACAAA TTGAAGAGCT 
PRAG RDYSAY YRQI EEL 

502 GCGAAGTCAGIIATTAAGGATG CTCAACTGCA AAATGCTCGG TGTGTCCTGC 
RS Q I KDAQLQ NARCVLQ 

552 AAATTGATAA TGCTAAACTG GCTGCTGAGG ACTTCAGACT GAA||GTATGAG 
1 D N A K LAA EDFRL K YE 

602 ACTGAGAGAG GAATACGTCT AACAGTGGAA GCTGATCTCC AAGGCCTGAA 
TE RG IRLTVEADLQGLN 

652 TAAGGTCTTT GATGACCTAA CCCTACATAA AACAGATTTG GAGATTCAAA 
KV F D DLT LHK TDLE1 Ql 

TTGAAGAACT GAATAAAGAC CTAGCTCTCC TCAAAAAGGA GCATCAGGAGII 
EEL NKDLALL KKE HQE 



702 



752 GAAGTCGATG GCCTACACAA GCATCTGGGC AACACTGTCA ATGTGGAGGT 
EVDG LHKHLGNTVNVEV 



802 



TGATGCTGCT CCAGGCCTGA ACCTTGGCGT CATCATGAAT GAAATGAGGC 
DAA PGLN LGV1MN EMRQ 

852 AGAAGTATGA AGTCATGGCC CAGAAGAACC TTCAAGAGGC CAAAGAACAG 
KYE VMAQKNLQEAKEQ 



Figure 5 (sheet 1 of 2) 
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902 TTTOAGAGAC AG,|ACTGCAGT TCTGCAGCAA CAGGTCACAG TGAATACTGA 
rtRQ TAVLQQQVTVNTE 

952 AGAATTAAAA GGAACTGAGG TTCAACTAAC GGAGCTGAGA CGCACCTCCC 
ELKGTEV QLTELRRTSQ 

•002 AGAGCCTTGA GATAGAACTC CAGTCCCATC TCAGCATG||AA AGAGTCTTTG 
aLE IE LOSHL S. \s v . 



E S L 



1052 GAGCACACTC TAGAGGAGAC CAAGGCCCGT TACAGCAGCC AGTTAGCCAA 

1 102 CCTCCAGTCG CTGTTGAGCT CTCTGGAGGC CCAACTGATG CAGATTCGGA 
LQSLLSS LEAQLM .QIrs 

^N^^^^E * ' 4% ^^q A ^ A ^^' ^^y^^^^^^^ TGACATAAAG 

ACTCG ACTTG AACAGGAAAT TGCTACTTAC CGCCGCCTTC TGGAAGGAGA 
Q £ 1 ATY RRLL EGE 

AGACGTAAA„A ACTACAGAAT ATC^GTTAAG CACCCTGGAA GAGAGAG^ATA 

TAAAGAAAAC CAGGAAGATT AAGACAGTCG TGCAAGAAGT AGTGGATGGC 
R K I KTVV QEVVDG 

AAGGTCGTGT CATCTGAAGT CAAAGAGGTG GAAGAAAATA TC[TAA|| AT AGC 

TACCAGAAGG AGATGCTGCT GAGGTTTTGA AAGAAATTTG GCTATAATCT 
TATCTTTGCT CCCTGCAAGA AATCAGCCAT AAGAAAGCAC TATTAATACT 
CTGCAGTGAT TAGAAGGGGT GGGGTGGCGG G AATCCT ATT TATCAGACTC 
TGTAATTG AA TATAAATGTT TTACTCAGAG GAGCTGCAAA TTGCCTGCAA 
AAATGAAATC CAGTGAGCAC TAGAATATTT AAAACATCAT TACTGCCATC 
TTTATCATGA AGCACATCAA TTACAAGCTG TAGACCACCT AATATCAATT 
TGTAGGTAAT GTTCCTGAAA ATTGCAATAC A TTCAATTA TA CT AAACCT 
CACAAAGTAG AGGAATCCAT GTAAATTGCA AATAAA 



1152 
1202 
1252 
1302 
1352 



1402 
1452 
1502 
1552 
1602 
1652 
1702 
1752 



Figure 5 (sheet 2 or 2) 



SUBSTITUTE SHEET (RULE 26} 



WO 96/17080 PCT/GB95/02734 

7/9 

! T 

2 GTCAGCAGAG GAGGAGTTTC TTGCCTGTGG ACTTCATAAA AGGCTAGCTC 

52 AACACCCTCC ATGAGACACA CTCTGCCCCA ACCATCCTGA AGCTACAGGT 

102 GCTCCCTCCT GGAATCTCCA ATGGATTTCA GTCGCAGAAG CTTCCACAGA 

MDFSRR S FHR 

152 AGCCTGAGCT CCTCCTTGCA GGCCCCTGTA GTCAGTACAG TGGGCATGCA 
S L S S SLQ APVVSTVGMQ 

202 GCGCCTCGGG ACGACACCCA GCGTTTATGG GGGTGCTGGA GGCCGGGGCA 
R L G TTPS VYGG AGGRGI 

252 TCCGCATCTC CAACTCCAGA CACACGGTGA ACTATGGGAG CGATCTCACA 
K * SNSR HTVN YGS DLT 

302 GGCGGCGGGG ACCTGTTTGT TGGCAATG AG AAAATGGCCA TGCAGAACCT 
G GG D LFV GNE K M A M Q NL 

352 AAATGACCGT CTAGCGAGCT ACCTAGAAAA GGTGCGGACG CTGGAGCAGT 
ND R LAS Y L EK VRT LE QS 

402 CCAACTCCAA ACTTGAAGTG CAAATCAAGC AGTGGTACGA AACCAACCGC 
N S K LEVQIKQWYE TNR 

452 CCGAGGGCTG GTCGCGACTA CAGTGCATAT TACAGACAAA TTGAAGAGCT 
PR AG R D Y SAY YRQI EE L 

502 GCG AAGTC AG|| ATTAAGG ATG CTCAACTGC A AAATGCTCGG TGTGTCCTGC 
RSQ I KDAQL Q narcvlo 



552 



702 



AAATTGATAA TGCTAAACTG GCTGCTGAGG ACTTCAGACT GAA||GTATGAG 
IDN A K LAAED FRLK YE 



602 ACTGAGAGAG GAATACGTCT AACAGTGGAA GCTGATCTCC AAGGCCTGAA 
TE RG I RLT VE ADLQ G LN 

652 TAAGGTCTTT GATGACCTAA CCCTACATAA AACAGATTTG GAGATTCAAA 
KVF DDLT LHK TDLEI QI 



TTGAAGAACT GAATAAAGAC CTAGCTCTCC TCAAAAAGGA GCATCAGGAGll 
E EL NKDLALL KKE HQE 



752 GAAGTCGATG GCCTACACAA GCATCTGGGC AACACTGTCA ATGTGGAGGT 
E V D G LHKHLG N T V N VEV 

802 TGATGCTGCT CCAGGCCTGA ACCTTGGCGT CATCATGAAT GAAATGAGGC 
DAA PGLN LGVIMN EMRQ 

Figure 6 (sheet 1 of 2) 
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852 



902 



952 
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AGAAGTATGA AGTCATGGCC CAGAAGAACC TTCAAGAGGC CAAAGAACAG 
KVEVMAQKNLQEAKEQ 

TTTGAGAGAC AG||ACTGCAGT TCTGCAGCAA CAGGTCACAG TGAATACTGA 
FERQ TAVLQ QQVT VNTE 



AGAATTAAAA GGAACTGAGG TTCAACTAAC GGAGCTGAGA CGCAGCTCCC 
£ LK GTEV QL T ELR RTSQ 

1002 AGAGCCTTGA GATAGAACTC CAGTCCCATC TCAGCATG||AA AGAGTCTTTG 
SLE IE LQSHL SMK ESL 

1052 GAGCACACTC TAGAGGAGAC CAAGGCCCGT TACAGCAGCC AGTTAGCCAA 
tHTLEET KARYSSQLAN 

CCTCCAGTCG CTGTTGAGCT CTCTGGAGGC CCAACTGATG CAG ATTCGGA 
LQSLLSS LEA Q L M Q I R s 

GTAACATGGA ACGCCAGAAC AACGAATACC^ATATCCTTCT TGACATAAAG 

ACTCGACITG AACAGGAAAT TGCTACTTAC CGCCGCCTTC TGGAAGGAGA 
TRLE QE I ATY RRLL EGE 

AGACGTAAAjlA ACTACAGAAT ATCAGTTAAG CACCCTGGAA GAGAGAG||ATA 

TAAAGAAAAC CAGGAAGATT AAGACAGTCG TGCAAGAAGT AGTGGATGGC 
KK TRKI KT VV QEVV - - 



1102 
1152 
1202 

1252 
1302 

1 



D G 



352 AAGGTCGTGT CATCTGAAGT CAAAGAGGTG GAAGAAAATA TCJT AA 1 1 AT AGC 
KV VS SEV KEVEENI • 

1402 TACCAGAAGG AGATGCTGCT GAGGTTTTGA AAGAAATTTG GCTATAATCT 

1452 TATCTTTGCT CCCTGCAAG A AATCAGCCAT AAGAAAGCAC TATTAATACT 

1502 CTGCAGTGAT TAGAAGGGGT GGGGTGGCGG GAATCCTATT TATCAGACTC 

1 552 TGT AATTGAA TATAAATGTT TTACTCAG AG G AGCTGC AAA TTGCCTGCAA 

1 602 AAATG AAATC CAGTGAGC AC TAG AATATTT AAAAC ATCAT TACTGCCATC 

1 652 TTTATCATG A AGCACATCAA TTAC AAGCTG TAG ACCACCT AATATCAATT 

1702 TGTAGGTAAT GTTCCTG AAA ATTGCAATAC A TTCAATTA TACTAAACCT 

1 752 CACAA AGTAG AGG AATCC AT GTAAATTGCA AAT AAA 

Figure 6 (sheet 2 of 2) 
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101 

151 

201 

251 

301 

351 

401 

451 

501 

551 

601 

651 



I, TGC GG ACCCTGGA GCAGTCCAAC TCCAAACTTG AAGTGCA A at 

« SJ^ G o GG TACGAA ACCA ACGCCCCGAG GGCTGGTCG? gIcta^A^G 

oi catattacag acaaattgaa gagctgcgaa gtcagattaa ggat?SST G 

CTGCAAAATG CTCGGTGTGT CCTGCAAATT GATAATcSJ JJ^SI^ 

tgaggacttc agactgaagt atgagagtga ?aga^ga^?a 
? G S^ GCTGA TCTCCAAGGG ctgaataagg tc?4?ga^ga otSc^Sa 
c^aaaacag atttggagat tcaaattgaa gaactgaata aagacctagc 
tctcctcaaa aaggagcatc aggaggaagt CGATGGCCTA ^caaSJSS 

Gr?r?^i C ^"GTCAATGTG GAGGTTGATG CTGCTCCAGG £££SS5 
GGCGTCATCA TGAATGAAAT GAGGCAGAAG TATGAAGTCA TGGCCCaSI 
^CCTTCAA GAGGCCAAAG AACAGTTTGA GAGACAGAcJ G^aSSctg^ 

agcaacaggt cacagtgaat actgaagaat taaaaggaac tSgcwS? 

^SSJ GC TGAGAC GCAC. CTCCCAGAGC CTTGAGATAG E£S?SSJ£ 
CCATCTCAGC ATGAAAGAGT CTTTGGAGCA CACTCTArAr ^T-^JS. 
751 rS^^ 0 CAGCCAG ™ CCCAACSS £S£S££ GAGcJc^G 
2o, ™££f CAAC TGATGCAG AT TCGGAGTAAC ATGGAACGCC CGA^cXaCGA 
J?f ™°S ATATC CTTCT T G ACA TAAAGACTCG ACTTGAACAG GAAATTGCTA 
851 CTTACCGCCG CCTTCTGGAA GGAGAAGACG TAAAAACTAC a7aYJ7$ST A 
901 TTAAGCACCC TGGAAGAGAG AGATATAAAG AAAAcJaJcA JSJJSilSS 
9 51 AGTCGTGCAA GAAGTAGTGG ATGGCAAGGT CGTGTCATCT JfJSSJfiS 
loH ^?r GGAAGA AAATATCTAA ATAGC^AG SSSSSS 

^^ GAAA TTTGGCTATA ATCTTATCTT TGCTCCCTGC SSSSSE 
1101 CCATAAGAAA GCACTATTAA TACTCTGCAG TGATTAGAAG GGG^gSg 
iili JCGGGAATCC TATTTATCAG ACTCTGTAAT TGAATATAAA TGT^aSc 
1201 AGAGGAGCTG CAAATTG CCT GCAAAAATGA AATCCAATGA SlSji™ 
IIH tZ£5F* C * "^GATTACTGC CaJSSES i£2S££ SS5JS£ 

i?SJ- G 5T GTAGACC acctaatatc aatttgtagg taatgttcct gaaaatJg^ 

1351 ATACATTTCA ATTATACTAA ACCTCACAAA GTAGAGGAAT CcllSJIffi 
"2* A^A^aI f ACTTrCTA ATTTTTAAAA AAAAAAAAAA £a£Ia£I 

/translation="EKVRTLEQSNSKLEVQIKQWYETNAPRAGRDYSAYYRQIEELRS 

QIKDAQLQNARCVLQIDNAKLAAEDFRLKYESERGIRLTVEADLQGLNKVFDDLTLHK 

TDLEIQIEELNKDLALLKKEHQEEVDGLHKHLGNTVNVEVDAAPGLNLGVIMNEMRQK 

YEVMAQKNLQEAKEQFERQTAVLQQQVTVNTEELKGTEVQLTELRRTSQSLEIELQSH 

LSMKESLEHTLEETKARYSSQLANLQSLLSSLEAQLMQIRSNMERPNNEYHILLDIKT 

RLEQEIATYRRLLEGEDVKTTEYQLSTLEERDIKKTTKIKTWQEWDGKWSSEVKE 
VEEN I" 
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